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Introduction & Background 


Sometimes it is very necessary for musicians and music fans to identify 
notes from music pieces. For a single note, this might be very easy even just 
by hearing. However, if it is an orchestra, then it is nearly impossible to 
identify every note being played. To help people with this need, we decided 
to use our knowledge in signal processing to write a Matlab program that 
could recognize notes in a melody. 


General background: 


For any sound played by an instrument, the pitch of the sound is determined 
by the fundamental frequency, while the relative strength of individual 
harmonics determines the timbre. Odd order harmonics work against 
fundamental frequency, whereas even order harmonics enforce the 
fundamental frequency to make the sound smoother. Human ears find even 
order harmonics more comfortable. 


And for different musical instruments, if they play the same note, the 
fundamental frequency of their spectrum would be the same, but the values 
of harmonics will be different. The varying height of those harmonics 
determines the unique timbre of the instrument, but the harmonics will all 
appear at fixed frequencies. So it is plausible to identify the note by looking 
at its spectrum and find corresponding fundamental frequency without 
worrying about which instrument played that note. 


algorithm for identifying the notes 
Algorithm For Identifying the Notes 
Step 1: Convert the music segment into frequency domain 


Because our recorded music files are chopped into fixed finite length 
segments, all we have to do is apply DFT to the sample. Considering speed 
of computation, we decided to use the method of computing with the DFT 
matrix . 


We know that DFT samples are defined by 


Where 
Thus can be expressed in matrix form as 
Where 


By using the matrix method to compute the DFT, we can calculate the DFT 
of many pieces of same length and same sample rate music segments at the 
same time using the same matrix. This will increase the overall speed of our 
program and enable tune identification for lengthy music that consists of 
thousands of segments to be analyzed. 


Step 2: Find the peaks in the frequency domain representation of the 
segmented music. Identify the peaks whose frequencies are near integer 
multiples of each other and group them into the same category. 


We know that notes played by instruments have harmonics which the 
frequencies are integer multiples of the base frequency. Often the note that’s 
being played is the base frequency or the first odd harmonic of the base 
frequency. Thus, peaks which frequencies are near integer multiples of each 
other indicate that they are mostly likely to be the harmonics of a base 
frequency and the note that’s being played lies within it. 


To identify the harmonics we used findpeaks function in Matlab and 
grouped them in a matrix which the i-jth entry is the frequency of the jth 
peak over the frequency of the ith peak. We then look for near integer 
entries in the matrix to determine peaks whose frequencies have integer 
multiples relationship and categorize them together. The peak with the 
lowest frequency in each category is the base band while the other peaks in 
the same category are its harmonics. 


Step3: Use the power of peaks to determine the most significant categories. 


Because there may be noise and other random vibrations when an 
instrument is played, there are categories which do not contain the note 
that’s played by the instrument. On the other hand, when multiple notes are 
played at the same time (aka a chord), there may be multiple categories 
which contain notes that are played. To correctly analyze the notes that are 
being played, we pick out the category which has the most power and 
categories whose powers are larger than one-tenth of that. The way we 
calculated the power of each category is by summing up the 
power(frequency*peak/2) of all the harmonics in that category. After this 
process, ambient noise and vibrations should be excluded and the remaining 
categories should contain the notes that are being played. 


Step 4: Determine the notes that lie in each category. 


Mostly, the base frequency of each category is the note that’s played by the 
instrument. However, we’ve learned that the note played by some 
instruments is either the base frequency or the first odd harmonic of the 
base frequency. To determine which note is being played, we compared the 
sum of the power of odd harmonics versus the sum of the power of the even 
harmonics in each category. When the power of even harmonics is larger 
than a certain times of the power of the odd harmonics, we say that the note 
is the base frequency; otherwise, we say that the note is the first odd 
harmonic of the base frequency. 


Testing of the program 
Test of the program: 


We tested our program with 6 signals. For generality, we arbitrarily chose 
signals of different instruments, such as guitar, violin and piano, and signals 
of different complexity (single note, chord with 3 notes and chord with 5 
notes). Among all these tests, 5 out of 6 succeeded. In the failed test, the 
input signal was a dual-note chord: A4 and E5 by piano. The program 
successfully recognized E5 but failed to identify A4. After analyzing we 
think the reason is that the 3™ harmonic of A4 is very close to the 2" 
harmonic of E5, so this gives an error when the program tries to distinguish 
the two. But other than that, the program works fine. It demonstrates the 
ability to identify notes in a melody, with or without noise, and by different 
instruments. 


The following are two trials we made with details. 
I. Trial with a single note: 


The input is a single piano B4 note with duration of about 0.1 sec. It is 
being extracted from a wav file. We can see its spectrum and time domain 
waveform below: 
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Note that in our program we use 44K Hz as the sample frequency, but the 
test signal may be sampled in a different frequency. (Like this one, being 
sampled with 11025 Hz) 


To fit that signal in the program, we need to resample the input with the rate 
44k Hz. This can be realized with the command: 


input1=resample (input, 44000, fs); 
With fs being the original sample frequency of the input. 
As shown below, the program successfully detected the pitch of the note. 


And the program also returns the first 8 peaks of the harmonics of the note. 
B 


ans = 


24.0000 1.0000 0.1961 0 0 0 0 0 0 


II. Trial with a chord of three notes 


This time we change the input to a chord with notes: E3, G3 and C4, played 
by a guitar. Its spectrum and time domain waveform are shown below: 
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Again we convert the sample rate to 44k Hz and the program returns the 
correct results. Each column of the matrix represents the amplitudes of 
harmonics of corresponding note. 
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Difficulty and application 
Difficulty encountered: 


In the frequency domain, a given single note is composed of a fundamental 
frequency and many harmonics which has frequencies that are multiples of 
the fundamental frequency. As a result, the identification of a single note 
would be relatively easy. However, when we observe the spectrum of a 
chord containing more than three notes, it is very hard to say exactly what 
the three notes are by one's eyes. The difficulty is that when multiple notes 
are played at the same time, the frequency spectrum of the signal becomes 
very messy and the spectrum of different notes somehow mixes together 
and become something blurry with all the notes hidden behind it. In order to 
solve this problem, we need to develop an algorithm that can still detect the 
characteristics of each note even when they intervene with each other and 
make each other ‘invisible’ on the spectrum.Application 


Application 


The most obvious application of tone identification is to help students 
learning music. If a student wants to play a piece of music without an actual 
music sheet, it would be very frustrating for being unable to identify tones 
in the music. This is where our tone-identifier can come in handy. The 
program can help the students to convert the audio signal he hears into 
visible music notes by just simply input the sound signal to the program. 
The program can also be helpful for professionals in music-relate field if we 
somehow improve the codes and algorithm. For example, if we improve the 
algorithm and optimize the codes we can make the program instantaneously 
convert audio signal to music notes. If we make this happen, a composer 
can greatly benefit from it since the program can recognize what the 
composer is playing and output the music sheet instantly. Further more, we 
can also make the program more sensitive to pitch. By this we mean that 
with a more detailed notes chart embedded in the program we can make the 
program to detect how close a played note is to the targeted frequency. And 
with this program in hand, tuner can work more efficiently 


